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© Cysteine added variants ("CAVs") of EPO are provided having one or more cysteine residues substituted for 
selected naturally occurring amino acid residues, or inserted into the polypeptide sequence, and preferably 
being further modified by deletion of certain N-terminal amino acids. Such CAVs may be additionally modified 
by the coupling of sutfhydryl reactive compounds to the introduced cysteine residue(s) without loss of bioactrvity 
to produce selected homogeneously modified EPO and improved pharmaceutical compositions containing the 
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1 io 
Met Ala Pro Pro Arg L*u lie Cyo Asp Ser Ar«j Val 
5'ATATG CCA CCA CCA A6A TTA ATT TGT CAT TCT ACA CTA 
TAC OCT GGT GCT TCT AAT TAA ACA CTA AGA TCT CAT 



Lau Clu Arg Tyr Leu Leu Clu Ala Lya Clu Ala Cla Asn 
TTA CAA CCG TAC CTC TTC GAG CCC AAC GAG GCC GAG AAT 
AAT CTT CCC ATG GAG AAC CTC CGG TTC CTC CCG CTC TTA 

lie Thr Thr Gly eye Ala Glu Bis Cys 8ar Leo Asn Gin 

ATC ACG ACC GCC TGT OCT GAA CAC TCC ACC TTG AAT GAG 
TAG TCC TGC CCG ACA CCA CTT CTC ACG TCC AAC TTA CTC 



Asn He Tar Val Pro Asp Thr Lye Val Asn Phe Tyr Ala 

AAT ATC ACT GTC CCA GAC ACC AAA CTT AAC TTT TAC GCG 

TTA TAG TCA CAG GGT CTC TGG TTT CAA TTG AAA ATG CCC 

55 

Trp Lys Arg Met Clu Vel Gly Gin Gin Ala Val Clu Val 

TGG AAA AGA ATG CAA CTT CCC CAC CAC OCT OTA GAA CTA 

ACC TTT TCT TAC CTT CAA CCG CTC GTC CCA CAT CTT CAT 



Trp Gin Gly Leu Ala Leu Leu Ser Glu Ala Val Leu Arg 

TGG CAG GGA TTA CCG CTA TTA ACT GAA GCT CTT CTC CGC 
ACC GTC OCT AAT CCC GAT AAT TCA CTT CGA CAA GAG CCG 



Gly Gin Ala Leu Leu Val Asn Ser Ser Gin Pro Trp Glu 
GGT CAG GCT TTA TTA CTC AAC TCT TCC CAC CCG TCC GAC 
CCA GTC CGA AAT AAT CAG TTG AGA ACG GTC GCC ACC CTC 



Pro Leu Gin Leu His Val Asp Lys Ala Val Ser Gly Leu 
CCC CTC CAC CTC CAT CTC GAT AAA GCC GTC AGT CCC CTT 
GCG GAT CTC CAC GTA CAC CTA TTT CCC CAG TCA CCC GAA 

115 

Arg Ser Leu Thr Thr Leu Leu Arg Ala Leu Gly Ala Gin 
CCC ACC CTC ACC ACT CTC CTT CCC GCT CTC GGA CCC CAG 
GCG TCC GAC TCC TCA GAC CAA CCC CGA CAC CCT CCC GTC 



Lys Glu Ala He Ser Pro Pro Asp Ala Ala Ser Ala Ala 
AAC GAA GCC ATC TCC CCT CCA GAT GCG GCC TCA GCT GCT 
TTC CTT CGG TAG AGG CCA GGT CTA CGC CGG ACT CGA CGA 

130 

Pro Leu Arg Thr He Thr Ala Asp Thr Pbe Arg Lys Leu 
CCA CTC CGA ACA ATC ACT CCT GAC ACT TTC CGA AAA CTC 
GGT GAG GCT TCT TAG TCA CGA CTC TCA AAC GCT TTT GAG 



Phe Arg Val Tyr Ser Asn Phe Leu Arg Gly Lys Leu Lys 
TTC CCA GTC TAC TCC AAT TTC CTC CGG CGA AAO CTC AAC 
AAC GCT CAG ATG ACG TTA AAC GAG GCC CCT TTC GAC TTC 



Leu Tyr Thr Gly Glu Ala Cys Arg Thr Gly Asp Arg 

CTG TAC ACA CGG GAC GCC TGC AGG ACA GGG GAC ACA 
GAC ATG TGT CCC CTC CGG ACG TCC TGT CCC CTG TCT 



STOP 

TAA TAATGATAGCATCCT 

ATT ATTACTATCCTACCACATC - 
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TECHNICAL FIELD 

This invention relates generally to polypeptides modified by the attachment of compounds having 
suKhydryl reactive groups, improved methods for producing such modified polypeptides and improved 
5 compositions containing them. The invention relates particularly to three modified polypeptides (IL-3, G-CSF 
and EPO), to which sulfhydryl reactive compounds, including polymers, may be attached at selected 
positions in the polypeptide that have been modified by the insertion of cysteine residues or the substitution 
of cysteine residues for other residues. 

w BACKGROUND 

The desirability of modifying biologically active and therapeutically useful polypeptides with a variety of 
compounds, such as the hydrophilic polymer polyethylene glycol (PEG), to enhance their pharmacokinetic 
properties has been noted. See, e.g., the discussion of the art in this area of polypeptide modification in 

75 published PCT patent application WO87/00056, in U.S. Pat. No. 4,179,337, which discloses conjugating 
water soluble polypeptides such as enzymes and insulin to PEG or PPG, and in U.S. Pat. No. 4,766,106, 
which discloses conjugating ordinarily water insoluble beta-interferon, interieukin-2, or immunotoxins to PEG 
homopolymers or polyoxyethylated glycerol. Such modification can reduce adverse immune response to 
the polypeptide, increase the solubility for use in pharmaceutical preparations and maintain a desirable 

20 circulatory level of such polypeptide for therapeutic efficacy. 

One problem not addressed by the art in this area involves the extent to which a polypeptide can be 
modified by attachment of compounds having reactive groups that will covalently bond to certain amino 
acid residues of the polypeptide. For example, modification of a polypeptide with PEG or similar polymers, 
can result in random attachment of the polymer at the amino terminus of the polypeptide and/or at one or 

25 more lysine residues in the amino acid sequence of the protein. Because more than one PEG group can 
attach to the polypeptide, the resultant composition may contain a heterogeneous mixture of "PEGylated" 
polypeptide; some polypeptides having only one PEGylated site, others having more than one PEGylated 
site. Such heterogeneity in composition is undesirable for pharmaceutical use. Furthermore, the non- 
specificity with regard to the srte(s) of attachment of compounds such as PEG to the polypeptide can result 

30 in loss of biological efficacy of the polypeptide stemming from undesirable attachment to a polypeptide site 
required for biological activity. United States Patent 4,904,584 addresses the foregoing by providing 
materials and methods for site specific covalent modification of polypeptides by lysine insertion, removal, 
and/or replacement. However, we have determined that the use of lysine as the attachment site for 
modification, for example, by PEGylation, may be disadvantageous because not all modifications may result 

35 in biologically active compounds and because steps must be taken to prevent PEGylation at N-termini in 
cases where N-terminal PEGylation is not desired. 

SUMMARY OF THE INVENTION 

40 This invention provides materials and methods for site specific covalent modification of polypeptides, 
particularly and preferably human IL-3, granulocyte colony stimulating factor (G-CSF) and erythropoietin 
(EPO) polypeptides, permitting the production of compositions comprising homogeneously cys modified 1L- 
3s, G-CSFs and EPOs and pharmaceutical compositions containing the same. "Homogeneously cys 
modified" as the term is used herein means substantially consistently modified only at specific, inserted or 

45 substituted cysteine residues. A homogeneously modified IL-3 for example, includes an IL-3 composition 
which is substantially consistently modified at position 6 (using the convention of counting from the N- 
terminus of the mature protein) by the insertion of cysteine in place of the threonine of natural IL-3, but not 
at other positions. 

Thus, this invention first provides cysteine added variants ("CAVs") of IL-3, G-CSF and EPO. CAVs of 
50 this invention encompass IL-3, G-CSF and EPO muteins that contain at least one additional cysteine residue 
compared to the corresponding naturally occurring or previously known IL-3, G-CSF and EPO. The cysteine 
residue(s) are introduced into the peptide structure of the CAVs at one or more amino acid positions in the 
natural or previously known counterpart 

In the case of human IL-3, we have determined that the naturally occurring cysteine residues at 
55 positions 16 and 84 form a disulfide bridge, essential to preserving the desired biological activity of the 
polypeptide. For the addition of novel cysteines, some positions within the polypeptide, such as position 15 
and 51 are unsuitable; cysteines introduced at these positions give rise to human IL-3 polypeptides with 
substantially reduced biological activity. However, certain substitutions or deletions of residues 1-14 do not 
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significantly diminish the desired biological activity of IL-3. Therefore, a preferred region of novel cysteine 
introduction into the polypeptide is within positions 1-14 inclusive. Currently, positions 6-12 inclusive are 
especially preferred sites for cysteine introduction. The subsequent attachment of sulfhydryl reactive 
compounds, including polymers, as discussed below, to the novel cysteines added at selected positions 

5 within this region will not result in any significant loss of biological activity. 

By "cysteine added variant" as the term is used herein, we mean variants of IL-3, G-CSF and EPO that 
are modified in amino acid structure relative to naturally occurring or previously known counterparts such 
that at least one cysteine residue is inserted into the natural or previously known sequence and/or is used 
to replace a different amino acid within that sequence. 

to Additionally, with respect to IL-3, the native or "natural" IL-3 sequence, with an added initiator 
methionine for bacterial expression, may be further modified such that the first alanine is deleted at the N- 
terminus of the mature polypeptide, altering the amino terminal sequence from METALA*PRO to Mb I "PRO 
(the "mp" mutein). For the "mp" mutein, such N-terminus modification permits more consistent removal of 
the N-terminal methionine. As is already known, in bacterial expression systems, cleavage at the N-terminaJ 

15 methionine occurs. Ukewise, the native EPO N-terminal sequence (with the added MET) begins 
METALA*PRO and it may prove advantageous to delete the first alanine to obtain an mpEPO mutein. With 
regard to G-CSF, the natural human N-terminal sequence begins with METTHRTRO (with the MET added 
for bacterial production) and it may be desirable to delete this N-terminal threonine to advantageously 
obtain a mpG-CSF mutein. 

20 Alternatively, the natural IL-3 sequence may be further modified such that the first two amino acids at 
the N-terminus of the mature polypeptide are deleted, leaving a terminus beginning with 
METTHR'GLNTHR* (the "m3" mutein). For the "m3" mutein, such N-terminus modification permits one to 
take advantage of the methionine at position 3 in the naturally occurring human IL-3 molecule, as the 
initiator methionine. 

25 The CAVs of this invention make it possible to produce homogeneous, biologically active IL-3, G-CSF 
and EPO compositions substantially specifically and consistently modified at selected positions with 
sulfhydryl reactive compounds (described hereinafter). 

In the practice of this invention, at least one cysteine residue is introduced in that portion of the IL-3, G- 
CSF or EPO polypeptide where modification via a sulfhydryl reactive compound is desired. The cysteine 

30 residue or residues are so introduced by genetic engineering methods as described below. Novel cysteine 
residues may be engineered into the polypeptide for example, by simple insertion of a cysteine codon into 
the DNA molecule at the desired site or by converting a desirably located asparagine or other codon to a 
cysteine codon. Convenient methods for site specific mutagenesis or DNA synthesis for producing a DNA 
molecule encoding the desired CAV. expression in procaryotic or eucaryotic host cells of the DNA molecule 

35 so produced, and recovery of the CAV produced by such expression are also disclosed. 

Tne CAVs of this invention retain useful biological properties of the natural or previously known protein 
and may thus be used for applications identified for the non-modified parent. Modification with such 
sulfhydryl reactive compounds, however, is preferred. Such biologically active, modified CAVs can be 
produced in homogeneous compositions which, rt is contemplated, will provide improved pharmacokinetic 

40 profiles, immunogenicity profiles, and/or solubility characteristics relative to the parent polypeptides. 
Furthermore, CAVs may enable the formation of multimeric forms of the normally monomeric polypeptide 
with the same, albeit improved characteristics. Multimeric CAVs also enable the formation of "hetero- 
conjugates"- i.e., two or more distinct polypeptides joined via the sulfhydryl groups of the added cysteine 
residues. e.g.. IL-3 joined to EPO or IL-3 joined to G-CSF. 

45 Biological activity of the CAVs before or after modification with the sulfhydryl reactive compounds may 
be determined by standard in vitro or in vivo assays conventional for measuring activity of the parent 
polypeptide. Alternatively, we provide herein a "small scale" screening method wherein successful Cys 
modification and attachment of the sulfhydryl reactive compound may be tested. 

Selective and homogeneous modification of the CAVs with sulfhydryl reactive compounds is possible 

so since such compounds will covalently bond primarily only to the cysteine residue(s) in the CAV. Secondary 
reactivity at His, Lys and Tyr residues) may be observed, depending on the choice of sulfhydryl reactive 
compound, but at a significantly lower rate. The modified CAVs so produced may then be recovered, and H 
desired, further purified and formulated into pharmaceutical compositions by conventional methods. 

Sulfhydryl reactive compounds include compounds such as polyalkytene glycol, e.g. polyethylene and 

55 polypropylene glycol, as well as derivatives thereof, with or without coupling agents or derivatization with 
coupling or activating moieties, for example, with thiol, triflate, tresylate, aziridine or oxirane, or preferably 
with S-pyridyl or maleimide moieties. Compounds such as S-Pyridyl Monomethoxy PEG and Maleimido 
Monomethoxy PEG are exemplary. Additionally, sulfhydryl reactive compounds include, but are not limited 
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to, charged or neutral polymers of the following types: dextran, colominlc acids or other carbohydrate based 
polymers, polymers of amino acids and biotin derivatives, resulting in a protein modified with this well 
known affinity reagent often used for antibody based assays. 

Briefly, the method comprises reacting the CAV with a suifhydryl reactive compound under suitable 

s conditions, preferably non-denaturing conditions, and in sufficient amounts permitting the covalent attach- 
ment of the suifhydryl reactive compound to the introduced cysteine residues) present in the polypeptide 
backbone of the CAV. The reaction may be reducible or non-reducible; and generally, the amount of 
suifhydryl reactive compound used should be at least equimolar to the number of cysteines to be 
derivatized, although use of excess suifhydryl reactive compound is preferred, both to improve the rate of 

10 reaction and to insure consistent modification at all reactive sites. The modified CAV produced, may then 
be recovered, purified and formulated by conventional methods. See e.g., WO 87/00056 and references 
cited therein. 

Other aspects of the present invention include therapeutic methods of treatment and therapeutic 
compositions which employ the modified CAVs of the present invention, either alone or with other 
?5 lymphokines, hematopoietins and/or growth factors, such as granulocyte macrophage colony-stimulating 
factor (GM-CSF) , macrophage colony-stimulating factor (M-CSF), IL-1, IL-2, IL-4, IL-5, IL-6 and IL-10. These 
methods and compositions take advantage of the improved pharmacokinetic properties of these modified 
CAVs to provide treatments, e.g.. such as employing lower dosages z of polypeptide, less frequent 
administration, lower immunogenicity and more desirable distribution, required for the therapeutic indica- 
20 tions for the natural polypeptide. 

Other aspects and advantages of the present invention will be apparent upon consideration of the 
following detailed description of the invention, including illustrative examples of the practice thereof. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is the human IL-3 gene construct for E. coli expression, having the polypeptide sequence shown 
of natural (wild type) human IL-3, plus an initiator methionine, as expressed in E.coli, with the amino acids 
numbered from the N-terminus for reference to the muteins discussed herein. 

Fig. 2 is the human G-CSF gene construct for E. coli expression, having the polypeptide sequence 
30 shown of natural (wild type) human G-CSF, plus an initiator methionine, as expressed in E. coli with the 
amino acids numbered from the N-terminus for reference to the muteins discussed herein. 

Fig. 3 is a chemically synthesized human EPO gene construct for E. coli expression, having the 
polypeptide sequence of natural (wild type) human EPO, plus an initiator methionine, as expressed in E. coli 
with the amino acids numbered from N-terminus for reference to the muteins discussed herein. 

35 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention involves the selective modification of IL-3, G-CSF and EPO for pharmaceutical 
use, to both enhance their pharmacokinetic properties and provide homogeneous compositions for human 

40 therapeutic use. Although human IL-3, DNA and peptide sequences are preferred as the starting point in 
this invention as it relates to IL-3, any primate IL-3 is susceptible to use in the method of the invention, 
given the significant homology between e.g.. human and gibbon species of the protein and DNA. See Lean/ 
et al., Blood (1982) 70: 1343-1348. The method for selectively modifying IL-3, G-CSF and EPO involves 
selecting locations in the polypeptide sequence for the attachment of suifhydryl reactive compounds. This 

45 step may be accomplished by altering the amino acid sequence of the polypeptide by inserting cysteine 
residues at selected sites or by converting selected endogenous residues into cysteine residues. For 
example, the codons AAA or AAG, which code for lysine, can be changed to the codon TGC or TGT, which 
code for cysteine. 

CAVs in accordance with this invention also include allelic variations in the protein sequence, i.e. 

so sequence variations due to natural variability from individual to individual, or with other amino acid 
substitutions or deletions which still retain desirable biological properties of the parent 

All CAVs of this invention may be prepared by expressing recombinant DNA sequences encoding the 
desired variant in host cells, e.g. procaryotic host cells such as E. coli, or eucaryotic host cells such as 
yeast or mammalian host cells, using methods and materials, e.g. vectors, as are known in the art. Host 

55 cells containing and capable of expressing the CAV-encoding DNA are thus encompassed by this invention. 
DNA sequences encoding the variants may be produced synthetically or by conventional site-directed 
mutagenesis of DNA sequences encoding the protein or polypeptide or analogs thereof. Figure 1 shows the 
human IL-3 gene construct inserted in plasmid pAL-hlL3-781 and expressed in the E coli K1 2 strain 



5 



EP 0 668 353 A1 



designated GI586. This strain containing the plasmid was deposited with the American Type Culture 
Collection, 12301 Parklawn Drive, Rockville. Maryland 20852 USA on April 19, 1989 and given accession 
number 67932. Other DNA sequences for natural primate IL-3 have been cloned and the DNA sequences, 
including cDNA sequences, and specific peptide sequences for the same have been published, in PCT 
application number US87/01702, published as WO 88/00598 on January 28, 1988, and are therefore known 
in the art These DNA sequences have been deposited with the American Type Culture Collection and 
given accession numbers ATCC 67154, 67326, 67319 and its replacement 68042, and 40246. DNA 
sequences for natural G-CSF and EPO have been cloned and the sequences and their corresponding 
peptide sequences published and are therefore known in the art 

DNA molecules encoding natural human IL-3s, G-CSFs and EPOs therefore may be obtained fi) by 
cloning in accordance with the published methods, (ii) from the deposited plasmids. or (Hi) by synthesis, 
e.g. using overlapping synthetic oligonucleotides based on the published sequences which together span 
the desired coding region. Such methods are known in the art See the foregoing PCT application published 
as WO 88/00598 and PCT application number US88/00402 published as WO88/06161. 

As mentioned above, DNA sequences encoding individual CAVs of this invention may be produced 
synthetically or by conventional site-directed mutagenesis of a DNA sequence encoding the parental 
polypeptides or analogs thereof. Such methods of mutagenesis include the M13 system of Zoller and 
Smith, Nucleic Acids Res. (1982) 10:6487 - 6500; Methods Enzymol. (1983) 1W>:46*500; and DNA (1984) 
3:479^488, which uses single stranded DNA and the method of Morinaga et at. Bio/technology (July 1984) 
636-639, which uses heteroduplexed DNA. Exemplary oligonucleotides used in accordance with such 
methods are described below. It should be understood, of course, that DNA encoding each of the CAVs of 
this invention may be analogously produced by one skilled in the art through site-directed mutagenesis 
using appropriately chosen oligonucleotides. 

The new DNA sequences encoding the CAVs of this invention can be introduced into appropriate 
vectors for heterologous expression in the desired host cells, whether procaryotic or eucaryotic. The activity 
produced by the transiently transfected or stably transformed host cells (or their progeny) may be 
measured by using standard assays conventional for the parental protein. Where the host cell is bacterial, 
the DNA should be free of introns, e.g. a cDNA or synthetic DNA, and may be free of any secretory leader 
sequence. For eucaryotic expression, introns may be present or absent and a secretory leader sequence 
should preferably be present 

The CAVs produced by expression in the genetically engineered host cells may then be purified, and if 
desired formulated into pharmaceutical compositions by conventional methods, often preferably by methods 
which are typically used in purifying and/or formulating the parental protein. R is contemplated that such 
pharmaceutical compositions containing the CAV in admixture with a pharmaceutically acceptable carrier 
will possess similar utilities to those of the parental proteins, such as those set forth in WO 88/00598 supra , 
at page 3. 

In another, and preferred, aspect of this invention, the CAVs produced by recombinant means as 
mentioned above are reacted with the desired sulfhydryl reactive compound under conditions permitting 
attachment of the sulfhydryl reactive moiety to the sulfhydryl group of the introduced cysteine residues in 
the peptide backbone of the CAV. These modified CAVs, preferably produced initially on a small scale, may 
then be screened for bioactive muteins possessing the sulfhydryl reactive compounds attached to the site 
or sites desired. Alternatively, this screening may be accomplished before attachment with the sulfhydryl 
reactive compound. 

The term "sulfhydryl reactive compound" is defined herein as any compound having, or capable of 
being activated to have, a reactive group capable of forming a covalent attachment to the sulfhydryl group 
(-SH) of the cysteine residue. Included among such compounds are polymers such as PEG and poly- 
propylene glycol (PPG), dextran, colominic acids or other carbohydrate based polymers and polymers of 
amino acids and biotin derivatives. Activation may occur by modification of the compound with a sulfhydryl 
moiety, such as a sulfhydryl group, thiol, triflate, tresylate, aziridine or oxirane, or preferably, with S-pyridyl 
or maleimide. The sulfhydryl reactive compound need not have any particular molecular weight, but a 
molecular weight of between about 1 ,000 and 30,000 for the activated compound is preferred, especially for 
PEG. Methods of attachment will be described in detail below. By controlling the number and location of the 
cysteines in the CAV sequence, the number and location(s) of the attached sulfhydryl reactive compound 
can be selectively controlled. Such control of attachment location and number enables the production of 
only certain selectively modified molecules retaining the desired biological activity, rather than production of 
a heterogeneous mixture of variably modified molecules, only some of which may be active. It is also 
important to note that this positional selectivity of the PEGylation or other attachment allows the normal 
functional interactions of the protein to be preserved, blocked, or regenerated by release of the sulfhydryl 
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reactive compound. 

Another aspect of the invention is therefore homogeneous compositions of modified CAVs as described 
herein, e.g. PEGylated CAVs. Specific embodiments of IL-3 CAVs of the invention include human IL-3 which 
has a cysteine residue replacing lysine at position 10 and the m3 initiation sequence. (Amino acid numbers 
for the CAVs of the present invention are used herein in the conventional manner, sequentially from the N- 
terminus, and correlate with the numbering system used in Fig. 1 for the natural human IL-3 as expressed 
in E. coli.) Similarly, the naturally occurring lysine residue in human IL-3 (Fig. 1) at amino acid position 100 
may bTconverted to a cysteine to create a human IL-3 CAV of the invention. Another embodiment has 
cysteine at positions 9 and 10 and the m3 initiation sequence. 

Specific embodiments of G-CSF and EPO CAVs of the invention include human G-CSF which has an 
alanine residue replacing the naturally occurring cysteine residue at position 17 and a cysteine residue 
replacing the naturally occurring alanine at position 37, and human EPO which has a cysteine residue 
replacing serine at position 9. The modification of Ala17 of G-CSF is made to prevent possible improper 
disulfide bridge formation. 

For bacterial expression where the secretory leader-encoding DNA sequence is removed from the CAV- 
encoding DNA, it may be desirable to additionally modify the sequence such that it encodes an N-terrninus 
comprising Met-Prc— (the mp mutein) instead of other N-termini such as Met-AJa-Pro fin IL-3 and EPO) or 
Met-Thr-Pro fin G-CSF). Such N-terminaJ modification permits more consistent removal of the N-terminal 
methionine. AKematively, the first two residues of natural, human IL-3 may be deleted, leaving the naturally 
20 occurring methionine at position 3 as the translation initiator (the m3 mutein). 

CAVs of this invention, modified as described, encompass CAVs containing other modifications as well, 
including truncation of the peptide sequence, deletion or replacement of additional amino acids with amino 
acids other than cysteine, insertion of new N-linked glycosylation sites, abolishment of natural N-linked 
glycosylation sites, etc., so long as the bioactivrty of the molecule is retained. Thus, this invention 
25 encompasses CAVs encoded for by DNA molecules which are capable of hybridizing under stringent 
conditions to the DNA molecule encoding the parental IL-3, G-CSF or EPO (or would be so capable but for 
the use of synonymous codons) so long as the encoded polypeptide contains one or more additional 
introduced cysteine residues relative to the parental peptide sequence. Exemplary stringent conditions can 
be found in T. Maniatis. et a!., Molecular Cloning (A Laboratory Manual) , Cold Spring Harbor Laboratory 
30 (1982), pages 387-389. An example of one such stringent hybridisation condition is 4XSSC at 65°C t 
followed by a washing in 0.1XSSC at 65°C for an hour. Alternatively, an exemplary stringent hybridization 
condition is 50% formamide. 4XSSC at 42°C. 

Because the method and compositions of this invention provide homogeneous modified IL-3s, G-CSFs 
and EPOs, the invention also encompasses such homogeneous compositions for pharmaceutical use which 
35 comprise a therapeutically effective amount of a modified CAV described above in admixture with a 
pharmaceutical^ acceptable carrier. Such composition can be used in generally the same manner as that 
described for the natural or recombinant polypeptides, tt is contemplated that the compositions will be used 
for treating a variety of conditions. e.g. involving stimulating hematopoiests or improving a patients 
hematological profile. For example, a modified IL-3 of the present invention may be used as an adjunct to 
40 cancer chemotherapy, radiation therapy, or in the treatment of immune disorders, as discussed in WO 
88/00598, at page 17-19. The exact dosage and method of administration will be determined by the 
attending physician depending on the particular modified CAV employed, the potency and pharmacokinetic 
profile of the particular compound as well as on various factors which modify the actions of drugs, for 
example, body weight, sex, diet, time of administration, drug combination, reaction sensitivities and severity 
45 of the particular case. Generally, the daily regimen should be in the range of the dosage for the natural or 
recombinant unmodified protein, e.g. a range of about 0.1 to about 100 ug of polypeptide per kilogram of 
body weight preferably from about 0.1 to about 30 ug of polypeptide per kilogram of body weight 

The therapeutic method and compositions of the present invention may also include co-administration 
with other drugs or human factors. A nonexclusive list of other appropriate hematopoietins, CSFs (colony 
stimulating factors) and interleukins for simultaneous or serial co-administration with the CAVs of the 
present invention includes GM-CSF, CSF-1 fin its various known forms; CSF-1 is also referred to as M-CSF 
or macrophage colony-stimulating factor), Meg-CSF. IL-1. IL-2. IL-4, IL-6. IL-10, B-cell growth factor, B-cell 
differentiation factor and eosinophil differentiation factor. Additionally, the CAVs of the present invention may 
be administered with, or chemically attached to, monoclonal or polyclonal antibodies in a therapeutic use. 
55 Alternatively, these growth factors may be attached to certain toxins, e.g.. ricin, for use in a therapeutic 
regimen. The dosage recited above would be adjusted to compensate for such additional components in 
the therapeutic composition or regimen. In the case of pharmaceutical compositions containing modified 
lymphokine CAVs, for example, progress of the treated patient can be monitored by periodic assessment of 
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the hematological profile, e.g. white cell count, hematocrit and the like. 

The following examples illustratively describe the CAVs and the methods and compositions of the 
present invention. 

5 EXPERIMENTAL MATERIALS, METHODS AND EXAMPLES 

EXAMPLE 1: Eucaryotic Expression Materials and Methods 

Eukaryotic cell expression vectors into which DNA sequences encoding CAVs of this invention may be 
jo inserted (with or without synthetic linkers, as required or desired) may be synthesized by techniques well 
known to those skilled in this art. The components of the vectors such as the bacterial replicons, selection 
genes enhancers, promoters, and the like may be obtained from natural sources or synthesized by known 
procedures. See Kaufman et al., J. Mol. Biol. , (1982) 159: 601-621, Kaufman, Proc. Natl. Acad. Sci. (1985) 
82:689-693. See also WO 87/04187, filed January 2, 1987 (pMT2 and pMT2-ADA), and US Patent 
is Application Serial No. 88,188, filed August 21, 1987 (pxMT2). Exemplary vectors useful for mammalian 
expression are also disclosed in the patent applications cited in Example 4, which are hereby incorporated 
by reference. Eucaryotic expression vectors useful in producing variants of this invention may also contain 
inducible promoters or comprise inducible expression systems as are known in the art. See US Patent 
Application Serial No. 893,115 (filed August 1, 1986) and PCT/US87/01871, published as WO88/00975 on 
20 February 11. 1988. 

Established cell lines, including transformed cell lines, are suitable as hosts. Normal diploid cells, cell 
strains derived from in vitro culture of primary tissue, as well as primary explants (including relatively 
undifferentiated cells such as hematopoietic stem cells) are also suitable. Candidate cells need not be 
genotypically deficient in the selection gene so long as the selection gene is dominantly acting. 

If eucaryotic host cells are used, they will preferably will established mammalian cell lines. For stable 
integration of the vector DNA into chromosomal DNA, and for subsequent amplification of the integrated 
vector DNA. both by conventional methods, CHO (Chinese Hamster Ovary) cells are presently preferred in 
such embodiments. Alternatively, the vector DNA may include all or part of the bovine papilloma virus 
genome (Lusky et al., Cell (1984) 36: 391-401) and be carried In cell lines such as C127 mouse cells as a 
stable episomal element. Other usable mammalian cell lines include HeLa, COS-1 monkey cells, melanoma 
cell lines such as Bowes cells, mouse L-929 cells, 3T3 lines derived from Swiss, Balb-c or NIH mice, BHK 
or HaK hamster cell lines and the like. 

Stable transformants then are screened for expression of the CAV product by standard immunological 
or activity assays. The presence of the DNA encoding the CAV polypeptides may be detected by standard 
procedures such as Southern blotting. Transient expression of the CAV genes during the several days after 
introduction of the expression vector DNA into suitable host cells such as COS-1 monkey cells is measured 
without selection by activity or immunologic assay of the proteins in the culture medium. 

Following the expression of the DNA by conventional means, the CAVs so produced may be recovered, 
purified, and/or characterized with respect to physicochemical, biochemical and/or clinical parameters, all by 
40 known methods. 

EXAMPLE 2: Bacterial and Yeast expression 

Bacterial and yeast expression may be effected by inserting (with or without synthetic linkers, as 
45 required or desired) the DNA molecule encoding the desired CAV into a suitable vector (or inserting the 
parental DNA sequence into the vector and mutagenizing the sequence as desired therein), then transform- 
ing the host cells with the vector so produced using conventional vectors and methods as are known in the 
art. e.g. as disclosed in published PCT Application No. WO 86/00639, published January 30, 1986. 
Transformants are identified by conventional methods and may be subcloned H desired. Characterization of 
so transformants and recombinant product so produced may be effected and the product recovered and 
purified, all as described in Example 1. 

For bacterial expression, the DNA sequences encoding the CAVs are preferably modified by conven- 
tional procedures to encode only the mature polypeptide and may optionally be modified to include 
preferred bacterial codons. 

55 



25 



30 



35 



8 



EP 0 668 353 A1 



Expression in E. coli : 

The IL-3 CAVs of Example 5 were expressed in E. coji as follows: Plasmid pAL-hlL3-781 was 
transformed into an E. coli K12 strain GI586, a derivative of strain W3110 in which the C, and Rex regions of 

s bacteriophage lambda carrying the C.857 allele have been inserted into the Clal site of the lacZ gene of the 
bacterial genome. This insert consists of all of the DNA sequences between nucleotides 35711 and 38104 
of the phage genome. See F. Sanger, et aJ., J. Mol. BioL (1982) 162:729. E. coli K12 strain GI586 (pAL-hlL3- 
781) was deposited at the ATCC on April 19, 1989 and given accession number 67932. 

When GI586 transformed with pAL-hlL3-781 is grown at 30 degrees centigrade to high cell density and 

w then heated to 40 degrees centigrade, IL-3 is produced rapidly and accumulates over the next two or three 
hours to reach greater than 10 percent of the total cellular protein. This protein is produced in an insoluble 
form which must be solubilized and refolded by conventional methods. See, e.g., T.E. Creighton, Prog. 
Biophys. Motec. Biol. (1978) 33:231-297. Following expression, the CAVs so produced were recovered, 
purified and characterized as follows. 

75 The G-CSF CAV of Example 5 and the EPO CAV of Example 6 were expressed similarly in E coli by 
removing the DNA sequence encoding CAV IL-3 from pAL-hlL3-781 and inserting the appropriate G-CSF or 
EPO DNA sequence as set forth in those Examples. Transformations were carried out under the same 
conditions set forth above. 

20 1. Purification of CAV IL-3 

All buffers were prepared using glass distilled water; all were degassed for at least five minutes, using 
house vacuum/sonrfication, prior to the addition of DTT. 

First. 400 grams wet weight frozen E. coli cell paste was suspended in 2500 ml of buffer containing 50 

25 mM Tris-HCJ, pH 8.5, 1 mM EDTA, 5 mM P-aminobenzamidine, 1 mM PMSF and 2 mM DTT (hereinafter in 
this Example "buffer A"), to obtain a final volume of 2850 ml. Glass rods and magnetic stirrers were used to 
resuspend the cell paste. Then the cell suspension was lysed by passing it through a matin gaulin valve at 
9000 psi four times, with cooling between each time. Temperature was maintained below 30 degrees 
centigrade by collection of the lysate into glass vessels cooled in ice/water mixture. Protein concentration 

30 was 22 mg/ml; final volume was 2850 ml. 

The lysate was centrrfuged for 30 minutes at 8000 rpm in a Sorval centrifuge with a GS-3 rotor. The 
supernatant (2600 ml at 17.0 mg/ml) was discarded and the resultant pellet (hereinafter in this Example P1) 
from this centrifugation was resuspended in approximately 400 ml buffer A, using glass rods and a 
magnetic stirrer. The milky suspension was then passed through an 18 gauge needle using a 60 ml syringe. 

35 The final volume was 640 ml, with a protein concentration of 25.6 mg/ml. 

The resuspended P1 pellet was then centrrfuged for 10 minutes in a Sorval centrifuge with a GS-3 rotor 
at 8000 rpm. The supernatant from this centrifugation was poured into two fresh centrifuge tubes 
(hereinafter in this example "S2") and the resultant pellet ("P2") was resuspended in buffer A to a final 
volume of 165 ml, with a protein concentration of 50 mg/ml. The S2 supernatant was then centrrfuged for 10 

40 minutes and the resulting pellet ( m P3 m ) was resuspended in 65 ml buffer A. The resulting supernatant ("S3") 
was further centrrfuged for 10 minutes and the resulting P4 pellet was resuspended in buffer A to a final 
volume of 50 ml, with a protein concentration of 11.3 mg/ml. Because the P4 pellet contained so little IL-3, it 
was not used in subsequent steps. The S4 supernatant from the final centrifugation, approximately 600 ml. 
contained the membranous components at a concentration of approximately 10 mg/ml. 

45 The P2 and P3 pellets were pooled and centrrfuged at 9000 rpm (GSA rotor) for 10 minutes yielding 
two pellets f P2-2") and a cloudy supernatant, which using HPLC analysis was found void of IL-3 and was 
discarded. The P2-2 pellet was frozen at -20 degrees centigrade for later use. 

The frozen P2-2 pellet was then resuspended in buffer A (which contained 10 mM DTT rather than 2 
mM DTT) to a final volume of 100 ml using glass rods and magnetic stiner and then passed through an 18 

w gauge needle. 400 ml of 7 M fresh guanidine in the 10 mM DTT buffer A was added to the resuspended 
P2-2 and after one quick inversion, the solubilized P2-2 pellet was immediately placed in 3 x 250 centrifuge 
tubes and centrrfuged for 15 minutes at 8000 rpm (GSA rotor). 500 ml of the supernatant at a concentration 
of 5.98 mg/ml was purified further at room temperature by RP-HPLC. The foregoing two steps were 
performed in 17 to 22 minutes. 

55 This purification protocol may be applied to the purification of the G-CSF and EPO CAVs expressed in 
E. coli as set forth above with similar results. 
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2. RP-HPLC separation of IL-3 CAVs 

The buffers used in this separation protocol were 0.1% <v/v) TFA in water, and 0.1% TFA in acetonitrile. 

A two inch Vydac C4 column was equilibrated in 10% acetonitrile. The supernatant from the 7 M 
guanidine solubilization was immediately applied onto the C4 column having a volume of approximately 470 
ml at 180 ml per minute. The column was developed at 20 ml per minute and was washed in 10% 
acetonitrile until absorbance at 280 nm was back to baseline. The following gradient was established by 
washing with the following concentrations of acetonitrile at the following times: 



time (in minutes) 


% acetonitrile 


5 


10 


10 


35 


55 


55 


60 


80 


65 


80 


67.5 


10 



40 ml fractions were collected after 35 minutes into the gradient. 10 ul samples were removed from 
each fraction, vacuum speed dried and taken up in 20 ul of 2x SDS-sample Laemmli buffer. SDS-PAGE 
analysis was performed and IL-3 presence was confirmed. All fractions were then frozen at-80 degrees 
centigrade 

Similarly, RP-HPLC separation of G-CSF and EPO CAVs may be accomplished. 



3. Refolding of IL-3 CAVs 

One of the RP-HPLC separated fractions containing approximately 75 mg (7.5 ml) IL-3 was diluted to 
approximately 0.5 mg/ml by the addition of 142.5 ml of 6.4 M guanidine in 50 mM NaPO* pH 7.0, 1 mM 
EDTA and 02 mM DTT. Tne mixture was then added to 750 ml of 50 mM NaPG* pH7, 1 mM EDTA, 02 
mM DTT buffer, transferred to dialysis tubing and dialyzed for two hours against 4 L of the same buffer. 
The IL-3 (now approximately 022 M guanidine) was twice further dialyzed against 8 L of the same buffer 
containing 0.1 mM DTT. 

PEGylation of this purified IL-3 is set forth in Examples 7 and 8 below. 

Refolding of G-CSF and EPO CAVs may be accomplished in the same manner. 



4. Confirmation of bioacrjvity 

Bioactivfty of IL-3 CAVs or EPO CAVs may be confirmed by using the TF-1 cell proliferation assay. The 
TF-1 cell line has been described (Kitamura et al., Blood (1989) 73:375-380). 

In that assay, cells are maintained at 37 degrees centigrade in humid air containing 5% CCfc and 
culture media used is RPMI (Gibco). 10% heat inactivated fetal calf serum. 2 mM L-giutamine. with 5 ng/rnl 
of recombinant GM-CSF added. Every 3^4 days cells are adjusted to a density of 2 x 10 s cells/ml. Just 
prior to assay the cells are centrifuged 500xG, 5 minutes, washed in culture media without rGM-CSF, 
recentrifuged and resuspended at a density of 10 s cells/ml. 

IL-3 or EPO samples to be assayed are diluted between 1:500 and 1:10,000 in culture media without 
rGM-CSF 125 ul of the diluted sample is placed in the top row of a 96 well microliter plate. The remaining 
wells are filled with 100 ul of culture media without rGM-CSF and the top row samples are serially diluted 
five fold down the microliter plate. To each well, 100 ul of diluted cells (10* cells) are added and the plate 
is incubated at 37 degrees centigrade, 5% C° 2 for 48-72 hours. Thereafter. 0.5 uCi *H-thymidine is added 
per well and the plate is further incubated for 4-6 hours. Cells are then harvested using an automated cell 
harvester (LKB 1295-001) and the 3 H-thymidine uptake is quantitated. 

Alternatively for IL-3, a CML proliferation assay as described in PCT/US87/017024, International 
Publication Number WO88/00598, published January 28, 1988. can be used. 

Bioactivity of G-CSF CAVs may be confirmed using the 32D murine cell line, as described in Hapel. et 
al Blood (1984) 64:786-790. and adding 5% v/v of WEHI-3 conditioned media from a 48 hour culture of 
WEHP3cells ATCC TIB68, (1x10* cells/ml) in RPMM640 media supplemented with 2 mM L-glutamine 
instead of the 5 ng/ml recombinant GM-CSF in the TF-1 cell proliferation protocol. The wash and assay 
steps are then carried out in the absence of WEHI-3 conditioned media. 
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EXAMPLE 4: Mutagenesis Protocol 

Site directed mutagenesis may be effected using conventional procedures known in the art. See e.g., 
Internationa! Applications Nos. WO 87/07144, and WO 87/04722, and US Patent Application Serial Nos. 
s 099,938 (filed September 23, 1987) and 088,188 (filed August 21, 1987) and the references cited therein. 

EXAMPLE 5: Exemplary Mutagenesis Reactions 

The following human IL-3. G-CSF and EPO muteins were engineered by substitution of the codons 
10 indicated for a cys codon. or by insertion of a cys codon, using conventional site directed mutagenesis 
techniques: 
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IL-3 



hid mutein 

mpCyslO 

mpCys6 

mpCys8 

mpCysl2 

inpCyslOO 

mpCysl34 

mpCys3 
mpMCysl9 



m3 mute in 

m3cys!0 

m3Cys6 

m 3 Cys 8 

m3Cysl2 

m3Cysl00 

m3Cysl34 



m3Cys6,10 

m3Cys9,10 
m3Cys6,8 



cys modification 

AAA to TGC (Lys to Cys) 

ACT to TGC (Thr to Cys) 

TCT to TGC (Ser to Cys) 

TCT to TGC (Ser to Cys) 

AAG to TGT (Lys to Cys) 

Insertion of TGT between 
TTC and TAG (Cys between 
Phe 133 and stop codon) 

ATG to TGC (Met to Cys) 

Replacement of amino acids 
1-15 with the M mp H 
terminus and modif. of 
pos. 19 from ATG to TGC 
(Met to Cys) 

ACT and AAA to 
TGC (Thr and Lys 
to Cys) 



TTA and AAA to TGC 
and Lys to Cys) 



(Leu 



ACT and TCT to TGC (Thr 
and Ser to Cys) 

m3Cys6 # 8,10 ACT, TCT and AAA to TGC 
(Thr, Ser and Lys to Cys) 

m3Cys8,9,10 TCT, TTA and AAA to TGC 
(Ser, Leu and Lys to Cys) 



G-CSF 



mp routein 
mpAlal7Cys37 



cys modification 

GCC to TGC (Ala to Cys) 



EPO 



mp mute in 
mpCys9 



cys modification 
TCT to TGT (Ser to Cys) 



In the examples depicted above the modification site of the natural IL-3, G-CSF or EPO protein is 
designated by the number after "Cys" and the amino acid sequence of the CAV is identical to that of the 
native protein, except for the position indicated, with respect to the N-terminus (see Bg. 1). The "mp" and 
"m3" designations signify the two different alterations of the N-terminus that will be discussed in detail 
below. Additionally, cys may be introduced in place of native IL-3 codons. for example at positions 63 or 66, 
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70 



75 



20 



25 



30 



35 



alone or in combination with other cys introduction(s). e.g. at position 10-with any of the described N- 
termini Contemplated EPO muteins include EPO mpCys166, having the mp N-terminus and the native 
arginine at position 166 deleted and replaced with cysteine, and mpCys24Cys38Cys83. having the mp N- 
terminus and the three N-linked glycosylation sites at the asparagine amino acids replaced with cysteines. 

With respect to IL-3 muteins. certain point modifications may result in partial loss of biological activity 
or inability of the sulfhydryl reactive compound to attach. For example, modification at position 28 results in 
a biologically active CAV, but attachment of a sulfhydryl reactive compound fails, possibly because position 
28 appears internally in the refolded CAVs tertiary structure. Compare. WingfieW. D.. et aL, Eur. J. Biochem. 
(1989) 179:565-571, in which the authors discussed the Cys modification of IL-10 at position 138 to active 
IL-1/S-phycoerythrin conjugate. Additionally, we have found that substitution of a cysteine residue for the 
amino acids at positions 15 or 51 of the natural human IL-3 may result in partial loss of bioactivity. To test 
for activity after attachment of the sulfhydryl reactive compound, this invention further provides a "small 
scale" screening technique to readily determine whether modification and attachment has been successful 
(see Example 9 below). 

The human IL-3 was additionally modified at its N-terminus in two different and alternative configura- 
tions, represented by the "mp" and "m3" designations. The "mp" designation indicates a deletion of the 
first alanine in the natural human IL-3 protein, thereby changing the N-terminal sequence from 
MET*ALA*PRO to METTRO. The "m3" designation indicates a deletion of the first two amino acids in the 
natural human IL-3 protein. METALA*PRO, to yield a terminus beginning MET"THR*GLU"THR\ The reasons 
for these modifications have already been discussed. With respect to N-terminus modification of the IL-3 
mpA1Cys19 mutein, amino acids 1-15 were deleted and replaced with the "mp" terminus. The human G- 
CSF and EPO muteins were additionally modified at the N-terminus by deletion of the first amino acid to 
obtain "mp" muteins. 

rt should be understood of course that the depicted list of muteins is merely exemplary and not 
exclusive. The design and synthesis of alternative and additional muteins in accord with this invention is 
well within the present skill in the art. Synthesis of such muteins may be conveniently effected using 
conventional techniques and methods. 

One skilled in the art, of course, could readily design and synthesize other muteins for substitution of 
cysteine codons or insertion thereof in DNA sequences encoding IL-3. G-CSF and EPO. To modify more 
than one site, mutagenesis may be carried out iteratively, or in some cases using an oligonucleotide 
designed for mutagenesis at more than one site. 

EXAMPLE 6: Synthesis of DNA molecules encoding CAVs 

As an alternative to the production of CAV-encoding DNA by mutagenesis of the parental DNA 
sequence, it should be understood that the desired CAV-encoding DNA may be prepared synthetically. In 
that case, it will usually be desirable to synthesize the CAV DNA in the form of overlapping 
oligonucleotides, e.g. overlapping 50-80mers, which together span the desired coding sequence and contain 
the cysteine additions desired: 



40 



An exemplary EPO mutein made in accordance with this Example is EPO mpCys9. The chemically 
synthesized cDNA shown in Figure 3 is assembled, modified at position 9. and purified using techniques 
know to those skilled in the art See Wosnick, et aL. Gene 60 :115-127 (1987); see also U.S. Patent No. 
so 4,904,584. The synthetic cDNA is designed with "overhanging end" nucleotide sequences compatible with 
those generated by the restriction enzymes. Ndel and Xbal. The purified, synthetically derived, cDNA is 
ligated with the purified Ndel-XbaJ vector portion of plasmid pAL-hlL3-781 , resulting in the replacement of 
human IL-3 cDNA with human erythropoietin cDNA EPOmpCys166 and EPO mpCys24Cys38Cys83 can be 
made in the same manner. 

55 Given a desired coding sequence, the design, synthesis, assembly and ligation, if desired, to synthetic 
linkers of other appropriate oligonucleotides is well within the present level of skill in the art 
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10 



15 



EXAMPLE 7: PEGvlation of the IL-3 mpCyslO mutein 

The mutein human IL-3 mpCyslO was prepared in accordance with Example 5 above and PEGylatod 
with two PEG 5000 derivatives, S-Pyridyl Monomethoxy PEG 5000 (PEG 5000 SPDP) and Male.m.do 
Monomethoxy PEG 5000 (PEG 5000 SMCC). 

1. PEGvlation with S-Pyridyl Monomethoxy PEG 5000: a red ucible linkage 
a.) Preparation of the sulfhydryl reactive compound. 

PEG 5000 was activated for attachment to a sulfhydryl group as follows. 2.0 grams of Monomethoxy 
PEG 5000 amine was dissolved in 12 ml dry peroxide free, dioxane. 144 mg (15% excess) of N- 
succinimidyl-3-(2 pyridyldithio) propionate (SPDP) was added as a dry powder and the reaction was allowed 
to proceed at room temperature. After 24 hours, the S-pyridyl Monomethoxy PEG 5000 product was 
precipitated using dry. peroxide free diethyl ether and washed with ether. The product was dried under 
vacuum to obtain 1.92 grams of white solid, which was identified as S-Pyridyl Monomethoxy PEG 5000 by 
NMR and IR. The PEG 10,000 analog (PEG 10,000 SPDP) was likewise prepared via an analogous 
procedure. 

20 b.) PEGvlation of mutein C10 human IL-3. 

For this coupling, natural (wild type) human IL-3 was also treated with the PEGylation reagents as a 
neqative control. A stock solution at 1 mg/ml of the mpCyslO mutein in a pH 7 buffered solution of 50 mM 
NaH 2 PO* 100 micro M DTT. 1 mM EDTA and about 3mM Guanidine HCI was used. DTT was added to 
prevent dimerization of the protein; EDTA was added to prevent dimerization via metal mediated oxidative 
coupling. Guanidine remains as an artifact of the refolding of the protein. A pH 7 was used; a range of 6.5- 
7 5 is preferred. 0.9 mg of S-Pyridyl Monomethoxy PEG 5000. prepared as set forth above, was weighed 
into an Eppendorf tube. 360 microliters of the buffered mutein was added and the mixture was vortexed 
briefly to homogeneity. The reaction was performed at 4 degrees centigrade and when sampled after 2 
hours was found to be complete. Analysis on a 10-20% gradient SDS acrylamide gel stained with 
Coomassie blue showed the product as nearly pure and running at about 28 kD. (By comparison. mpCyslO 
and its dimer were used as standards and found to migrate to 15 and 30 kD respectively.) A reducing lane 
on the gel showed that the PEGylated IL-3 mutein is sensitive to reduction by DTT and regenerated the 
original protein at about 15 kD. 



25 



30 



35 



2. PEGvlation with Maleimido Monomethoxy PEG 5000: a non-reducible linkage 

a. ) Preparation of the sulfhydryl reactive compound. 

40 In this experiment PEG 5000 activation was accomplished as follows. 2.0 mg of monomethoxy PEG 
5000 amine was dissolved in 12 ml of dry. peroxide free dioxane. 154 mg of sulfosuccinimidyl 4-<N- 
maleimidomethyl) cyclohexane-1-cart)oxylate (SMCC) (15% excess) was added as a dry powder and the 
reaction was allowed to proceed at room temperature. After 24 hours, work up of the product was earned 
out in the same manner as the S-Pyridyl Monomethoxy PEG 5000 to obtain 1.82 grams Maleimido 

45 Monomethoxy PEG 5000. The PEG 1 0.000 analog was prepared similarly. 

b. ) PEGylation of the cyslO IL-3 mutein. 

-mis reaction was carried out In the same manner as the PEGylation reaction using the reducible PEG 
so 5000 reagent and with natural human IL-3 as a negative control. However, 1.0 mg of the PEG derived 
PEGylating agent Maleimido Monomethoxy PEG 5000 was used. 400 microliters of the mpCyslO IL-3 
mutein was added and vortexed to homogeneity. At 1=2 hours the reaction was found to be complete. The 
product was nearly pure and indistinguishable from the S Pyridyl derived conjugate in molecular weight 
However, this product is perfectly inert to reductive conditions, such as DTT; in this reducing lane the 

55 product at 28 kD. persists. 

In both control reactions, nothing indicative of conjugation is evident at 2 hrs or even at 24 nrs. 
Selectivity for accessible suKhydryls in this chemistry is therefore very high. 
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EXAMPLE 8: PEGylation of multiple cysteine muteins m3Cys9,10 and m3Cys6,10 

In this experiment, protein stock for both muteins was at 300 ug/ml in the phosphate buffer solution, as 
described in Example 7. PEGylation stock solutions consisted of the S-pyridyl or Maleimide activated PEG 

5 5000 polymers at 50 ug/ml in the same buffer. To initiate the reaction. 11 ul of the appropriate PEG stock 
was added to 100 ul of the appropriate protein stock (either the m3Cys9.10 mutein or the m3Cys6.10 
mutein) while vortexing. Reactions were allowed to proceed at 4 degrees centigrade overnight. SDS gel 
analysis of the products as described above revealed that only a trace of starting material remained with 
both chemistries. Furthermore, both chemistries resulted in new products with a gel mobility of about 37 kD. 

w Reducing lanes on this same gel show that the maleimide conjugate is resistant to reducing, while the S- 
Pyridyl derived conjugate reverts to starting material. 

EXAMPLE 9: PEGylation of the G-CSF mpAla17Cys37 mutein 

75 The mutein human G-CSF mpAla17Cys37 was prepared in accordance with Example 5 above and 
PEGylated with PEG 5000 SPDP. PEG 5000 SMCC and PEG 10.000 SPDP. The natural cysteine at position 
17 was deleted and replaced with alanine to prevent possible improper disulfide bridge formation. The 
mpAla17Cys37 G-CSF protein stock was dialyzed into a pH 7.0 buffering solution of 50mM NahfePO* and 
1mM EDTA, 100uM DTT and concentrated to approximately 100ug/ml. Total volume was 670 ul. 

20 Stocks of the PEGylation reagents (PEG 5000 SPDP, PEG 5000 SMCC. and PEG 10,000 SPDP) were 
made up fresh at 10mM in H2O (18 x stock solutions prepared as described in Examples, parts la and 
1b). The reactions were carried out using the reagents and amounts set forth below. 
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Rxn# 


PEG Reagent 


Stock 


Ul Protein 


Ul PEG 


1 


5000 SPDP 


170 


10 


2 


5000 SMCC 


170 


10 


3 


10.000 SPDP 


170 


10 



All reactions were 1mM in PEG reagent As in Example 7. the mutein was added to the PEG reagent 
and vortexed to homogeneity. The reactions were allowed to proceed at 4°C overnight and found to be 
. complete. 

35 Analysis on a 10-20% gradient SDS acrylamide gel with both reducing and non-reducing lanes and 

stained with Coomassie blue showed the mpAla17Cys37 PEG 5000 SPDP and mpAla17Cys37 PEG 5000 

SMCC products as nearly pure and running at about 28 kD. 

Non-PEGylated G-CSF mute in runs at approximately 19kD. A reducing lane on the gel showed that the 

SPDP conjugate is sensitive to reduction in the presence of DTT. The SMCC derived reagent was totally 
40 resistant to reductive treatment The PEG 10,000 SPDP PEGylated mpAla17Cys37 G-CSF runs at 

approximately 32 kD. The non-PEGylated G-CSF mutein is regenerated by DTT treatment 



EXAMPLE 10: PEGylation of the EPO mpCys9 mutein 

45 The mutein human EPO mpCys9 prepared in accordance with Example 5 above can be PEGylated 
using the stock PEGylation reagents prepared In Example 9 and the natural human EPO as a negative 
control. To initiate the reaction, 10 ul of the appropriate PEG stock as prepared in Example 9 is added to 
170 ul of the EPO mpCys9 stock at a concentration of lOOug/ml. The mixture is vortexed and the reaction 
is allowed to proceed at 4°C overnight Upon completion, the reaction can be analyzed as in Example 9. 

50 

EXAMPLE 11: Screening of novel CAVs 

Having the constructed novel DNA molecules encoding CAVs in the appropriate expression vector and 
having attached the sulfhydryl reactive compound to the muteins, it may be desirable to produce each CAV 
55 protein on a small scale and "screen" for muteins which possess the desired attachment site or sites. The 
biological activity of each CAV. before and after attachment of the sulfhydryl reactive compound can be 
rapidly assessed using an in vitro assay. 
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Small scale bacterial production of IL-3 CAV muteins 

Bacterial strain GI586 was transformed with purified plasmid DNAs consisting of bacterial expression 
vector pAL-hlL3-781. ATCC Accession Number 67932, with novel CAV IL-3 coding sequences. The 
transformed cells were spread on LB agar plates containing 50 ug/ml ampicillin at a density to yield 
approximately 100 colonies per plate. 3 ml of L broth plus 50 ug/ml ampicillin was inoculated with a single 
bacterial colony and grown overnight at 30 degrees centigrade. 50 ml of induction media (O.lx L Broth, 
1xM9 salts. 0.4% glucose, 1mM MgS0 4 . 50 ug/ml ampicillin) was inoculated with 1 ml of the overnight 
culture. The 50 ml culture was grown with aeration at 30 degrees centigrade until an 0.5 OD 600 nm level 
was reached, then the temperature was shifted to 40 degrees centigrade and growth continued for at least 2 
hours. 

Cells were then harvested by centrifugation at 3500 rpm for 5 minutes in a Sorval centrifuge with a 3B 
rotor. The supernatant was discarded and the cell pellet resuspended in 1 ml of buffer PED (50 mM 
NaH2P04, pH 7.0, 1 mM EDTA, 5mM DTT, 10 mM PMSF). This 1 ml solution was passed twice through a 
French Press at 10,000 psi and kept on ice. The solution was then microfuged for 5 minutes at 12,000 rpm. 
The supernatant was discarded and the pelleted material was resuspended in 150 ul of 7 M guanidine-HCI 
in PED buffer. The solution was then diluted with 650 ul of PED buffer and placed in dialysis tubing (10,000 
MWCO Spectrapore). The sample was dialyzed for at least 4 hours against 2 liters of PED.1 buffer (50 mM 
NaH2P04 pH 7.0, 1 mM EDTA, 0.1 mM DTT). The sample was collected and microfuged to remove 
precipitated proteins. The sample was then analyzed on a 12% Laemmli SDS PAGE gel and the amount of 
IL-3 protein estimated. 

The protein solution was then concentrated to about 0.5 mg/ml. and 200 ug was reacted with a 15 fold 
molar excess of either S-Pyridyl Monomethoxy PEG 5000 or Maleimido Monomethoxy PEG 5000 for 
several hours at 4 degrees centigrade. The products were then analyzed by SDS PAGE and biological 
activity determined by an in vitro TF-1 cell proliferation assay. 

This small scale production methodology may be similarly advantageously applied to production of the 
novel CAV G-CSFs and EPOs of the present invention. 

Alternatively, this small scale production for screening may be carried out before attachment of the 
sulfhydryl reactive compound. In that case, biological activity may still be determined by an in vitro TF-1 
cell proliferation assay (or in the case of G-CSF, a 32D cell proliferation assay) and the products may be 
analyzed by SDS PAGE analysis, in accordance with known techniques. 

The same or similar procedures may be used by one skilled in the art to attach other sulfhydryl reactive 
compounds to the other CAVs of the invention. Homogeneity can be observed by conventional analysis of 
the modified CAVs so produced e.g. using standard SDS-PAGE or HPLC analysis. 

Numerous modifications may be made by one skilled in the art to the methods and compositions of the 
present invention in view of the disclosure herein. Such modifications are believed to be encompassed by 
this invention as defined by the appended claims. 

Claims 

1. A cysteine added variant ("CAV") of EPO, characterized in that said CAV comprises a peptide 
sequence of human EPO modified to contain at least one non-native cysteine residue at which residue 
said CAV is covalently attached to at least one sulfhydryl reactive compound. 

2. A CAV of claim 1, characterized in that the N-terminus of the CAV commences with methionine and the 
native amino acid at position 1 of the mature peptide sequence is deleted. 

3. A CAV of any of claims 1 to 2, characterized in that said at least one sulfhydryl group is selected from 
one or more of the group consisting of dextran, a carbohydrate based polymer, such as a colominic 
acid, a polymer of an amino acid, biotin, and a polyalkylene glycol moiety. 

4. A CAV of any of claims 1 to 2, characterized in that said at least one sulfhydryl reactive compound is 
polyethylene glycol. 

5. A DNA sequence encoding a CAV comprising a peptide sequence of human EPO modified to contain 
at least one non-native cysteine residue for attachment of at least one sulfhydryl reactive compound. 

6. A host cell containing and capable of expressing a DNA sequence of claim 5. 



16 



EP 0 668 353 A1 



7. A method of producing a CAV of any of claims 1 to 4, comprising covalently attaching at least one 
sulfhydryl reactive compound to at least one non-native cysteine of said CAV produced by culturing a 
host cell containing and capable of expressing a DNA sequence encoding said CAV. 

5 8. Use of a CAV of any of claims 1 to 4 for the preparation of pharmaceutical compositions suitable for 
stimulating hematopoiesis. 
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FIGURE 1 



1 10 
ATG GCT CCT ATG ACT CAA ACT ACT TCT TTA AAA ACT TCT 
Met Ala Pro Met Thr Gin Thr Thr Ser Leu Lys Thr Ser 

25 

TGG GTA AAC TGT TCT AAC ATG ATC GAT GAA ATT ATA ACA 
Trp Val Asn Cys Ser Asn Met He Asp Glu He He Thr 



CAC TTA AAG CAG CCA CCT TTG CCC TTG CTG GAC TTC AAC 
His Leu Lys Gin Pro Pro Leu Pro Leu Leu Asp Phe Asn 

40 

AAC CTC AAT GGG GAA GAC CAA GAC ATT CTG ATG GAA AAT 
Asn Leu Asn Gly Glu Asp Gin Asp He Leu Met Glu Asn 

55 

AAC CTT CGA AGG CCA AAC CTG GAG GCA TTC AAC AGG GCT 
Asn Leu Arg Arg Pro Asn Leu Glu Ala Phe Asn Arg Ala 

70 

GTC AAG AGT CTG CAA AAT GCA TCA GCA ATT GAG AGC ATT 
Val Lys Ser Leu Gin Asn Ala Ser Ala He Glu Ser He 

85 

CTG AAA AAT CTG CTG CCA TGT CTG CCC CTG GCC ACA GCT 
He Lys Asn Leu Leu Pro Cys Leu Pro Leu Ala Thr Ala 

100 

GCA CCC ACC AGG CAT CCA ATC CAT ATC AAG GAT GGT GAC 
Ala Pro Thr Arg His Pro He His He Lys Asp Gly Asp 

115 

TGG AAT GAA TTC CGC CGC AAA CTG ACC TTC TAT CTG AAA 
Trp Asn Glu Phe Arg Arg Lys Leu Thr Phe Tyr Leu Lys 



ACC CTG GAG AAT GCT CAG GCT CAG CAG ACC ACC CTG AGC 
Thr Leu Glu Asn Ala Gin Ala Gin Gin Thr Thr Leu Ser 

130 

CTC GCG ATC TTC TAG 
Leu Ala He Phe Stop 
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FIGURE 2a 



1 10 



ATG 
MET 



ACC CCC CTG GGC CCT GCC AGC TCC CTG CCC CAG AGC TTC 
Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe 

20 

CTG CTC AAG TGC TTA GAG CAA GTG AGG AAG ATC CAG GGC 
Leu Leu Lys Cys Leu Glu Gin Val Arg Lys lie Gin Gly 

30 

GAT GGC GCA GCG CTC CAG GAG AAG CTG TGT GCC ACC TAC 
Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala Thr Tyr 

40 50 
AAG CTG TGC CAC CCC GAG GAG CTG GTG CTG CTC GGA CAC 

Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly His 

60 

TCT CTG GGC ATC CCC TGG GCT CCC CTG AGC AGC TGC CCC 
Ser Leu Gly He Pro Trp Ala Pro Leu Ser Ser Cys Pro 

70 

AGC CAG GCC CTG CAG CTG GCA GGC TGC TTG AGC CAA CTC 
Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu 



80 



90 



CAT AGC GGC CTT TTC CTC TAC CAG GGG CTC CTG CAG GCC 
His Ser Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala 

100 

CTG GAA GGG ATC TCC CCC GAG TTG GGT CCC ACC TTG GAC 
Leu Glu Gly He Ser Pro Glu Leu Gly Pro Thr Leu Asp 

110 

ACA CTG CAG CTG GAC GTC GCC GAC TTT GCC ACC ACC ATC 
Thr Leu Gin Leu Asp Val Ala Asp Phe Ala Thr Thr He 

120 130 
TGG CAG CAG ATG GAA GAA CTG GGA ATG GCC CCT GCC CTG 
Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala Leu 

140 

CAG CCC ACC CAG GGT GCC ATG CCG GCC TTC GCC TCT GCT 
Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala 

150 

TTC CAG CGC CGG GCA GGA GGG GTC CTG GTT GCC TCC CAT 
Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His 
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FIGURE 2b 



160 

CTG CAG AGC TTC CTG GAG GTG TCG TAC CGC GTT CTA CGC 
Leu Gin Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg 



170 174 
CAC CTT GCC CAG CCC T 
His Leu Ala Gin Pro 
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FIGURE 3a 
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10 




Met 


Ala 


Pro 


Pro 


Arg 


Leu 


lie Cys Asp 


Ser Arg 


Val 


5 1 ATATG 


GCA 


CCA 


CCA 


AGA 


TTA 


ATT TGT GAT 


TCT AGA 


GTA 


TAC 


CGT 


GGT 


GGT 


TCT 


AAT 


TAA ACA CTA 


AGA TCT 


CAT 



Leu 
TTA 
AAT 


Glu 
GAA 
CTT 


Arg 
CGG 
GCC 


Tyr 
TAC 
ATG 


Leu 
CTC 
GAG 


Leu 
TTG 
AAC 


25 
He 
ATC 
TAG 


Thr 
ACG 
TGC 


Thr 
ACG 
TGC 


Gly 
GGC 
CCG 


Cys 
TGT 
ACA 


Ala 
GCT 
CGA 


Asn 
AAT 
TTA 


He 
ATC 
TAG 


40 
Thr 
ACT 
TGA 


Val 
GTC 
CAG 


Pro 
CCA 
GGT 


Asp 
GAC 
CTG 


Trp 

TGG 
ACC 


Lys 
AAA 
TTT 


Arg 
AGA 
TCT 


Met 
ATG 
TAC 


55 
GlU 
GAA 
CTT 


Val 
GTT 
CAA 


Trp 
TGG 
ACC 


Gin 
CAG 
GTC 


Gly 
GGA 
CCT 


Leu 
TTA 
AAT 


Ala 
GCG 
CGC 


Leu 
CTA 
GAT 


Gly 
GGT 
CCA 


Gin 
CAG 
GTC 


Ala 

GCT 
CGA 


Leu 
TTA 
AAT 


Leu 
TTA 
AAT 


Val 
GTC 
CAG 


Pro 
CCC 
GGG 


Leu 
CTG 
GAT 


Gin 
CAG 
GTC 


Leu 
CTG 
GAC 


His 
CAT 
GTA 


Val 
GTG 
CAC 


Arg 
CGC 
GCG 


Ser 
AGC 
TCG 


Leu 
CTC 
GAG 


Thr 
ACC 
TGG 


Thr 
ACT 
TGA 


Leu 
CTG 
GAC 



Glu Ala Lys Glu Ala Glu Asn 
GAG GCC AAG GAG GCC GAG AAT 
CTC CGG TTC CTC CGG CTC TTA 



Glu His Cys Ser Leu Asn Glu 
GAA CAC TGC AGC TTG AAT GAG 
CTT GTG ACG TCG AAC TTA CTC 



Thr Lys Val Asn Phe Tyr Ala 
ACC AAA GTT AAC TTT TAC GCG 
TGG TTT CAA TTG AAA ATG CGC 



Gly Gin Gin Ala Val Glu Val 
GGC CAG CAG GCT GTA GAA GTA 
CCG GTC GTC CGA CAT CTT CAT 

70 

Leu Ser Glu Ala Val Leu Arg 
TTA AGT GAA GCT GTT CTC CGC 
AAT TCA CTT CGA CAA GAG GCG 

85 

Asn Ser Ser Gin Pro Trp Glu 
AAC TCT TCC CAG CCG TGG GAG 
TTG AGA AGG GTC GGC ACC CTC 

100 

Asp Lys Ala Val Ser Gly Leu 
GAT AAA GCC GTC AGT GGC CTT 
CTA TTT CGG CAG TCA CCG GAA 

115 

Leu Arg Ala Leu Gly Ala Gin 
CTT CGG GCT CTG GGA GCC CAG' 
GAA GCC CGA GAC CCT CGG GTC 
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FIGURE 3b 



Lys Glu Ala He Ser Pro Pro Asp Ala Ala Ser Ala Ala 
AAG GAA GCC ATC TCC CCT CCA GAT GCG GCC TCA GCT GCT 
TTC CTT CGG TAG AGG GGA GGT CTA CGC CGG AGT CGA CGA 

130 

Pro Leu Arg Thr He Thr Ala Asp Thr Phe Arg Lys Leu 
CCA CTC CGA ACA ATC ACT GCT GAC ACT TTC CGA AAA CTC 
GGT GAG GCT TGT TAG TGA CGA CTG TGA AAG GCT TTT GAG 

145 

Phe Arg Val Tyr Ser Asn Phe Leu Arg Gly Lys Leu Lys 
TTC CGA GTC TAC TCC AAT TTC CTC CGG GGA AAG CTG AAG 
AAG GCT CAG ATG AGG TTA AAG GAG GCC CCT TTC GAC TTC 

160 

Leu Tyr Thr Gly Glu Ala Cys Arg Thr Gly Asp Arg 
CTG TAC ACA GGG GAG GCC TGC AGG ACA GGG GAC AGA 
GAC ATG TGT CCC CTC CGG ACG TCC TGT CCC CTG TCT 

STOP 

TAA TAATGATAGGATCCT 

ATT ATTACTATCCTAGGAGATC - 5' 
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